3D Human Body-Part Tracking and Action Classification Using A Hierarchical Body Model

نویسندگان

  • Leonid M. Raskin
  • Michael Rudzsky
  • Ehud Rivlin
چکیده

Human body pose tracking is a challenging task for several reasons. The large variety of poses and high dimensionality of the human model complicates the examination of the entire subject and makes it harder to detect each body part separately. However, the poses can be presented in a low dimensional space using the dimensionality reduction techniques. Such a reduction improves the trackers robustness, ability to recover from temporary target loss, and the computational effectiveness. In this paper we introduce a Hierarchical Annealing Particle Filter (H-APF) tracker which exploits the Hierarchical Human Body Model in order to achieve accurate body part estimates. We apply a nonlinear dimensionality reduction using the Hierarchical Gaussian Process Latent Variable Model (HGPLVM) [1] and the Annealing Particle Filter (APF) [2] is used for the propagation between the sequential frames. A hierarchical model of the human body expresses conditional dependencies between the body parts and allows us to capture properties of separate parts. The human body model consists of two independent parts: one containing information about 3D location and orientation of the body and the other describing the articulation of the body. The articulation is represented as a hierarchy of body parts. The method uses previously observed poses from various motion types to generate mapping functions from the lowdimensional latent spaces to the data space that describes the poses. The tracking algorithm consists of two stages. First, the particles are generated in the latent space and are transformed to the data space using the learned mapping functions. Second, rotation and translation parameters are added to obtain valid poses. Finally, the likelihood function is calculated in order to evaluate how well these poses match the image. The resulting tracker estimates the locations in the latent spaces that represent poses with the highest likelihood. We demonstrate that our tracker is capable of robust tracking the human poses even in the scenarios that envolves several different motions. Our method performs action classification in the latent spaces, produced by HGPLVM. A pose estimated on each frame corresponds to a coordinate in the latent space. Therefore, an action is represented by a curve in this latent space. The classification of the motion is based on the comparison of the sequences of latent coordinates that the tracker produces to the sequences that represent different actions (we denote such sequences as models). The modified Frèchet distance [3] is used in order to perform the comparison. This approach allows for the introduction of actions different from those used for the learning of the latent spaces by exploiting the model that represents it. We also show that the action classification, when performed in the latent space, is robust and has a high accuracy rate. We tested H-APF tracker using the HumanEvaI and HumanEvaII [4] and some home-made datasets. The sequences contain different activities, such as walking, boxing,and jogging, which were captured by several (four) synchronized and calibrated cameras. The sequences were captured using the MoCap system that provides the correct 3D locations of the body joints, such as shoulders and knees. This information is used for an evaluation of the results and a comparison to other tracking algorithms. The error is calculated, based on a comparison of the tracker’s output to the ground truth, using the average distance in millimeters between 3-D joint locations [5]. We trained HGPLVM with several different motion types and used this latent space in order to track the body parts on the videos from the HumanEvaI and HumanEvaII datasets. Figure 1 shows the result of the tracking of the HumanEvaII(S2) dataset which combines three different behaviors: walking, jogging and balancing. Do demonstrade the ability of the classification algorithm we used several datasets. One of them includes five different activities: (1) hand waving, (2) lifting an object, (3) kicking, (4) sitting down, and (5) punchframe 230 frame 640 frame 700 frame 800 Figure 1: Tracking results of H-APF tracker. Sample frames from the combo1 sequence from HumanEvaII(S2) dataset.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

RASKIN, RUDZSKY, RIVLIN: BODY-PART TRACKING AND ACTION CLASSIFICATION 1 3D Human Body-Part Tracking and Action Classification Using a Hierarchical Body Model

This paper presents a framework for hierarchical 3D articulated human body-part tracking and action classification. We introduce a Hierarchical Annealing Particle Filter (H-APF) algorithm, which applies nonlinear dimensionality reduction of the high dimensional data space to the low dimensional latent spaces combined with the dynamic motion model and the Hierarchical Human Body Model. The impro...

متن کامل

تخمین چنددوربینی حالت سه بعدی انسان با برازش افکنش مدل اسکلت سه بعدی مفصل دار در تصاویر سایه نما

Automatic capture and analysis of human motion, based on images or video is important issue in computer vision due to the vast number of applications in animation, surveillance, biomechanics, Human Computer Interaction, entertainment and game industry. In these applications, it is clear that 3D human pose estimation is an essential part. Therefore, its accuracy has a great effect on the perform...

متن کامل

A Dynamic 3D Human Model using Hybrid 2D-3D Representations in Hierarchical PCA Space

We propose a novel framework for a hybrid 2D-3D dynamic human model with which robust matching and tracking of a 3D skeleton model of a human body among multiple views can be performed. We describe a method that measures the image ambiguity at each view. The 3D skeleton model and the correspondence between the model and its 2D images are learnt using hierarchical principal component analysis. T...

متن کامل

A Dynamic Human Model using Hybrid 2D-3D Representations in Hierarchical PCA Space

We propose a novel framework for a hybrid 2D-3D dynamic human model with which robust matching and tracking of a 3D skeleton model of a human body among multiple views can be performed. We describe a method that measures the image ambiguity at each view. The 3D skeleton model and the correspondence between the model and its 2D images are learnt using hierarchical principal component analysis. T...

متن کامل

Tracking Hybrid 2D-3D Human Models from Multiple Views

A novel framework is proposed under which robust matching and tracking of a 3D skeleton model of a human body from multiple views can be performed. We propose a method for measuring the ambiguity of 2D measurements provided by each view. The ambiguity measurement is then used for selecting the best view for the most accurate match and tracking. A hybrid 2D-3D representation is chosen for modell...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009